Dataset statistics
| Number of variables | 79 |
|---|---|
| Number of observations | 10100 |
| Missing cells | 519148 |
| Missing cells (%) | 65.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 5.9 MiB |
| Average record size in memory | 611.0 B |
Variable types
| Categorical | 9 |
|---|---|
| DateTime | 1 |
| Boolean | 47 |
| Numeric | 22 |
chills has constant value "True" | Constant |
cough has constant value "True" | Constant |
diarrhoea has constant value "True" | Constant |
event_admission has constant value "True" | Constant |
event_enrolment has constant value "True" | Constant |
feeling_faint has constant value "True" | Constant |
sore_throat has constant value "True" | Constant |
spleen_palpation has constant value "False" | Constant |
study_no has a high cardinality: 664 distinct values | High cardinality |
haematocrit is highly correlated with haematocrit_percent and 7 other fields | High correlation |
haematocrit_percent is highly correlated with haematocrit and 2 other fields | High correlation |
haemoglobin is highly correlated with haematocrit and 2 other fields | High correlation |
joint_pain_level is highly correlated with muscle_pain_level | High correlation |
liver_size is highly correlated with haematocrit_percent and 3 other fields | High correlation |
lymphocytes_percent is highly correlated with haematocrit | High correlation |
monocytes_percent is highly correlated with haematocrit | High correlation |
muscle_pain_level is highly correlated with joint_pain_level | High correlation |
neutrophils_percent is highly correlated with haematocrit | High correlation |
plt is highly correlated with haematocrit | High correlation |
wbc is highly correlated with haematocrit and 1 other fields | High correlation |
weight is highly correlated with haematocrit and 1 other fields | High correlation |
abdominal_tenderness has 7725 (76.5%) missing values | Missing |
albumin has 7831 (77.5%) missing values | Missing |
alt has 7838 (77.6%) missing values | Missing |
ascites has 7725 (76.5%) missing values | Missing |
ast has 7840 (77.6%) missing values | Missing |
bleeding has 7725 (76.5%) missing values | Missing |
bleeding_gum has 7723 (76.5%) missing values | Missing |
bleeding_nose has 7723 (76.5%) missing values | Missing |
bleeding_other has 10082 (99.8%) missing values | Missing |
bleeding_vaginal has 7723 (76.5%) missing values | Missing |
body_temperature has 7453 (73.8%) missing values | Missing |
bruising has 7723 (76.5%) missing values | Missing |
chills has 9490 (94.0%) missing values | Missing |
clinical_shock has 7725 (76.5%) missing values | Missing |
conjunctival has 7725 (76.5%) missing values | Missing |
convulsions has 7725 (76.5%) missing values | Missing |
cough has 9738 (96.4%) missing values | Missing |
creatine_kinase has 7842 (77.6%) missing values | Missing |
diarrhoea has 9818 (97.2%) missing values | Missing |
event_admission has 9436 (93.4%) missing values | Missing |
event_enrolment has 9436 (93.4%) missing values | Missing |
feeling_faint has 9636 (95.4%) missing values | Missing |
fetal has 9536 (94.4%) missing values | Missing |
fibrinogen has 7873 (78.0%) missing values | Missing |
haematocrit has 10057 (99.6%) missing values | Missing |
haematocrit_percent has 5743 (56.9%) missing values | Missing |
haemoglobin has 5743 (56.9%) missing values | Missing |
hematemesis has 7723 (76.5%) missing values | Missing |
hematuria has 7723 (76.5%) missing values | Missing |
inr has 7835 (77.6%) missing values | Missing |
jaundice has 7725 (76.5%) missing values | Missing |
lethargy has 7725 (76.5%) missing values | Missing |
liver_palpation has 7725 (76.5%) missing values | Missing |
lymphadenopathy has 7725 (76.5%) missing values | Missing |
lymphocytes_percent has 5744 (56.9%) missing values | Missing |
melaena has 7723 (76.5%) missing values | Missing |
meningism has 7725 (76.5%) missing values | Missing |
monocytes_percent has 5744 (56.9%) missing values | Missing |
nausea has 9603 (95.1%) missing values | Missing |
neurology has 7725 (76.5%) missing values | Missing |
neutrophils_percent has 5743 (56.9%) missing values | Missing |
oedema has 7370 (73.0%) missing values | Missing |
oedema_face has 7723 (76.5%) missing values | Missing |
oedema_feet has 7723 (76.5%) missing values | Missing |
oedema_hands has 7723 (76.5%) missing values | Missing |
petechiae has 7723 (76.5%) missing values | Missing |
pharyngeal has 7725 (76.5%) missing values | Missing |
pleural_effusion has 7725 (76.5%) missing values | Missing |
plt has 5744 (56.9%) missing values | Missing |
pulse has 7453 (73.8%) missing values | Missing |
rales_crackles has 7725 (76.5%) missing values | Missing |
respiratory_distress has 7725 (76.5%) missing values | Missing |
respiratory_rate has 7453 (73.8%) missing values | Missing |
restlessness has 7725 (76.5%) missing values | Missing |
rhinitis has 7725 (76.5%) missing values | Missing |
sbp has 7453 (73.8%) missing values | Missing |
skin_flush has 7725 (76.5%) missing values | Missing |
skin_rash has 7725 (76.5%) missing values | Missing |
sore_throat has 9912 (98.1%) missing values | Missing |
spleen_palpation has 7725 (76.5%) missing values | Missing |
tck has 7874 (78.0%) missing values | Missing |
tq has 7870 (77.9%) missing values | Missing |
uterus_tender has 9173 (90.8%) missing values | Missing |
vagina_loss has 9168 (90.8%) missing values | Missing |
wbc has 5743 (56.9%) missing values | Missing |
weight has 6669 (66.0%) missing values | Missing |
study_no is uniformly distributed | Uniform |
bleeding_other is uniformly distributed | Uniform |
Reproduction
| Analysis started | 2021-01-24 10:50:15.601118 |
|---|---|
| Analysis finished | 2021-01-24 10:51:09.697329 |
| Duration | 54.1 seconds |
| Software version | pandas-profiling v2.10.0 |
| Download configuration | config.yaml |
| Distinct | 664 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.0 KiB |
| 03-5431 | 27 |
|---|---|
| 03-5425 | 27 |
| 03-5010 | 26 |
| 03-5007 | 25 |
| 03-5443 | 25 |
| Other values (659) |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 70700 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 03-1001 |
|---|---|
| 2nd row | 03-1001 |
| 3rd row | 03-1001 |
| 4th row | 03-1001 |
| 5th row | 03-1001 |
| Value | Count | Frequency (%) |
| 03-5431 | 27 | 0.3% |
| 03-5425 | 27 | 0.3% |
| 03-5010 | 26 | 0.3% |
| 03-5007 | 25 | 0.2% |
| 03-5443 | 25 | 0.2% |
| 03-5381 | 24 | 0.2% |
| 03-5360 | 24 | 0.2% |
| 03-5407 | 23 | 0.2% |
| 03-5260 | 23 | 0.2% |
| 03-5202 | 23 | 0.2% |
| Other values (654) | 9853 |
| Value | Count | Frequency (%) |
| 03-5431 | 27 | 0.3% |
| 03-5425 | 27 | 0.3% |
| 03-5010 | 26 | 0.3% |
| 03-5007 | 25 | 0.2% |
| 03-5443 | 25 | 0.2% |
| 03-5381 | 24 | 0.2% |
| 03-5360 | 24 | 0.2% |
| 03-5407 | 23 | 0.2% |
| 03-5260 | 23 | 0.2% |
| 03-5202 | 23 | 0.2% |
| Other values (654) | 9853 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 14510 | |
| 3 | 13680 | |
| 5 | 12781 | |
| - | 10100 | |
| 1 | 4465 | 6.3% |
| 4 | 3730 | 5.3% |
| 2 | 3573 | 5.1% |
| 6 | 2075 | 2.9% |
| 9 | 1945 | 2.8% |
| 8 | 1930 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 60600 | |
| Dash Punctuation | 10100 | 14.3% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 14510 | |
| 3 | 13680 | |
| 5 | 12781 | |
| 1 | 4465 | 7.4% |
| 4 | 3730 | 6.2% |
| 2 | 3573 | 5.9% |
| 6 | 2075 | 3.4% |
| 9 | 1945 | 3.2% |
| 8 | 1930 | 3.2% |
| 7 | 1911 | 3.2% |
| Value | Count | Frequency (%) |
| - | 10100 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 70700 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 14510 | |
| 3 | 13680 | |
| 5 | 12781 | |
| - | 10100 | |
| 1 | 4465 | 6.3% |
| 4 | 3730 | 5.3% |
| 2 | 3573 | 5.1% |
| 6 | 2075 | 2.9% |
| 9 | 1945 | 2.8% |
| 8 | 1930 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 70700 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 14510 | |
| 3 | 13680 | |
| 5 | 12781 | |
| - | 10100 | |
| 1 | 4465 | 6.3% |
| 4 | 3730 | 5.3% |
| 2 | 3573 | 5.1% |
| 6 | 2075 | 2.9% |
| 9 | 1945 | 2.8% |
| 8 | 1930 | 2.7% |
date
Date
| Distinct | 4523 |
|---|---|
| Distinct (%) | 44.9% |
| Missing | 27 |
| Missing (%) | 0.3% |
| Memory size | 79.0 KiB |
| Minimum | 2016-10-05 00:00:00 |
|---|---|
| Maximum | 2018-05-17 00:00:00 |
abdominal_pain_level
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.0 KiB |
| nan | |
|---|---|
| 1.0 | 279 |
| 2.0 | 45 |
| 3.0 | 2 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 30300 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | nan |
|---|---|
| 2nd row | nan |
| 3rd row | nan |
| 4th row | nan |
| 5th row | nan |
| Value | Count | Frequency (%) |
| nan | 9774 | |
| 1.0 | 279 | 2.8% |
| 2.0 | 45 | 0.4% |
| 3.0 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| nan | 9774 | |
| 1.0 | 279 | 2.8% |
| 2.0 | 45 | 0.4% |
| 3.0 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 19548 | |
| a | 9774 | |
| . | 326 | 1.1% |
| 0 | 326 | 1.1% |
| 1 | 279 | 0.9% |
| 2 | 45 | 0.1% |
| 3 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 29322 | |
| Decimal Number | 652 | 2.2% |
| Other Punctuation | 326 | 1.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 326 | |
| 1 | 279 | |
| 2 | 45 | 6.9% |
| 3 | 2 | 0.3% |
| Value | Count | Frequency (%) |
| n | 19548 | |
| a | 9774 |
| Value | Count | Frequency (%) |
| . | 326 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29322 | |
| Common | 978 | 3.2% |
Most frequent character per script
| Value | Count | Frequency (%) |
| . | 326 | |
| 0 | 326 | |
| 1 | 279 | |
| 2 | 45 | 4.6% |
| 3 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| n | 19548 | |
| a | 9774 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30300 |
Most frequent character per block
| Value | Count | Frequency (%) |
| n | 19548 | |
| a | 9774 | |
| . | 326 | 1.1% |
| 0 | 326 | 1.1% |
| 1 | 279 | 0.9% |
| 2 | 45 | 0.1% |
| 3 | 2 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 283 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2092 | 20.7% |
| True | 283 | 2.8% |
| (Missing) | 7725 |
age
Real number (ℝ≥0)
| Distinct | 25 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.87108911 |
|---|---|
| Minimum | 18 |
| Maximum | 45 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 23 |
| median | 27 |
| Q3 | 30 |
| 95-th percentile | 35 |
| Maximum | 45 |
| Range | 27 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 4.720488742 |
|---|---|
| Coefficient of variation (CV) | 0.1756716567 |
| Kurtosis | -0.264299352 |
| Mean | 26.87108911 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.2444607218 |
| Sum | 271398 |
| Variance | 22.28301396 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 27 | 918 | 9.1% |
| 25 | 893 | 8.8% |
| 26 | 789 | 7.8% |
| 30 | 737 | 7.3% |
| 28 | 724 | 7.2% |
| 23 | 703 | 7.0% |
| 24 | 678 | 6.7% |
| 29 | 565 | 5.6% |
| 33 | 565 | 5.6% |
| 32 | 430 | 4.3% |
| Other values (15) | 3098 |
| Value | Count | Frequency (%) |
| 18 | 247 | |
| 19 | 394 | |
| 20 | 377 | |
| 21 | 421 | |
| 22 | 393 |
| Value | Count | Frequency (%) |
| 45 | 17 | |
| 41 | 12 | |
| 40 | 19 | |
| 39 | 29 | |
| 38 | 27 |
| Distinct | 225 |
|---|---|
| Distinct (%) | 9.9% |
| Missing | 7831 |
| Missing (%) | 77.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.87831644 |
|---|---|
| Minimum | 23.5 |
| Maximum | 51.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 23.5 |
|---|---|
| 5-th percentile | 30.1 |
| Q1 | 34.6 |
| median | 38.1 |
| Q3 | 41.3 |
| 95-th percentile | 45.1 |
| Maximum | 51.3 |
| Range | 27.8 |
| Interquartile range (IQR) | 6.7 |
Descriptive statistics
| Standard deviation | 4.598959566 |
|---|---|
| Coefficient of variation (CV) | 0.1214140437 |
| Kurtosis | -0.4630757061 |
| Mean | 37.87831644 |
| Median Absolute Deviation (MAD) | 3.3 |
| Skewness | -0.1403309986 |
| Sum | 85945.9 |
| Variance | 21.15042909 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 35.4 | 28 | 0.3% |
| 40 | 24 | 0.2% |
| 40.4 | 24 | 0.2% |
| 39.3 | 24 | 0.2% |
| 38.8 | 23 | 0.2% |
| 35.2 | 23 | 0.2% |
| 39.4 | 23 | 0.2% |
| 34.2 | 22 | 0.2% |
| 41.6 | 22 | 0.2% |
| 38.4 | 22 | 0.2% |
| Other values (215) | 2034 | 20.1% |
| (Missing) | 7831 |
| Value | Count | Frequency (%) |
| 23.5 | 1 | |
| 23.8 | 1 | |
| 24.4 | 1 | |
| 24.7 | 1 | |
| 25.7 | 1 |
| Value | Count | Frequency (%) |
| 51.3 | 1 | |
| 50.2 | 1 | |
| 49.6 | 1 | |
| 49.5 | 1 | |
| 49.1 | 1 |
| Distinct | 420 |
|---|---|
| Distinct (%) | 18.6% |
| Missing | 7838 |
| Missing (%) | 77.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 81.80910698 |
|---|---|
| Minimum | 5 |
| Maximum | 988 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 25 |
| median | 47 |
| Q3 | 100.825 |
| 95-th percentile | 255 |
| Maximum | 988 |
| Range | 983 |
| Interquartile range (IQR) | 75.825 |
Descriptive statistics
| Standard deviation | 97.41557869 |
|---|---|
| Coefficient of variation (CV) | 1.190766924 |
| Kurtosis | 16.88983276 |
| Mean | 81.80910698 |
| Median Absolute Deviation (MAD) | 29 |
| Skewness | 3.3831445 |
| Sum | 185052.2 |
| Variance | 9489.794972 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 21 | 46 | 0.5% |
| 18 | 42 | 0.4% |
| 19 | 39 | 0.4% |
| 27 | 39 | 0.4% |
| 15 | 35 | 0.3% |
| 17 | 35 | 0.3% |
| 26 | 35 | 0.3% |
| 14 | 34 | 0.3% |
| 16 | 33 | 0.3% |
| 33 | 33 | 0.3% |
| Other values (410) | 1891 | 18.7% |
| (Missing) | 7838 |
| Value | Count | Frequency (%) |
| 5 | 4 | < 0.1% |
| 7 | 5 | < 0.1% |
| 7.1 | 1 | < 0.1% |
| 8 | 11 | |
| 9 | 13 |
| Value | Count | Frequency (%) |
| 988 | 1 | |
| 971 | 1 | |
| 811 | 2 | |
| 710 | 1 | |
| 689 | 2 |
anorexia
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.0 KiB |
| True | |
|---|---|
| False | 767 |
| Value | Count | Frequency (%) |
| True | 9333 | |
| False | 767 | 7.6% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 1 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2374 | 23.5% |
| True | 1 | < 0.1% |
| (Missing) | 7725 |
| Distinct | 471 |
|---|---|
| Distinct (%) | 20.8% |
| Missing | 7840 |
| Missing (%) | 77.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 102.5814602 |
|---|---|
| Minimum | 11 |
| Maximum | 1000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 17.58 |
| Q1 | 30 |
| median | 58 |
| Q3 | 120.325 |
| 95-th percentile | 339.05 |
| Maximum | 1000 |
| Range | 989 |
| Interquartile range (IQR) | 90.325 |
Descriptive statistics
| Standard deviation | 128.111302 |
|---|---|
| Coefficient of variation (CV) | 1.248873839 |
| Kurtosis | 14.57590997 |
| Mean | 102.5814602 |
| Median Absolute Deviation (MAD) | 33 |
| Skewness | 3.354431735 |
| Sum | 231834.1 |
| Variance | 16412.50569 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 29 | 45 | 0.4% |
| 24 | 41 | 0.4% |
| 19 | 39 | 0.4% |
| 23 | 38 | 0.4% |
| 26 | 38 | 0.4% |
| 22 | 38 | 0.4% |
| 31 | 38 | 0.4% |
| 21 | 37 | 0.4% |
| 28 | 35 | 0.3% |
| 43 | 31 | 0.3% |
| Other values (461) | 1880 | 18.6% |
| (Missing) | 7840 |
| Value | Count | Frequency (%) |
| 11 | 2 | < 0.1% |
| 12 | 9 | |
| 12.9 | 1 | < 0.1% |
| 13 | 7 | |
| 13.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1000 | 1 | |
| 988 | 1 | |
| 982 | 1 | |
| 946 | 1 | |
| 943 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| True | |
|---|---|
| False | |
| (Missing) |
| Value | Count | Frequency (%) |
| True | 1452 | 14.4% |
| False | 923 | 9.1% |
| (Missing) | 7725 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7723 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 98 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2279 | 22.6% |
| True | 98 | 1.0% |
| (Missing) | 7723 |
bleeding_mucosal
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.0 KiB |
| False | |
|---|---|
| True | 169 |
| Value | Count | Frequency (%) |
| False | 9931 | |
| True | 169 | 1.7% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7723 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 41 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2336 | 23.1% |
| True | 41 | 0.4% |
| (Missing) | 7723 |
| Distinct | 16 |
|---|---|
| Distinct (%) | 88.9% |
| Missing | 10082 |
| Missing (%) | 99.8% |
| Memory size | 79.0 KiB |
| FEW VAGINAL DISCHARGE | |
|---|---|
| VAGINAL LEAKAGE DUE TO MISCARRIAGE | 1 |
| HEMOTYPSIE | 1 |
| RECOVERY RASH ON 2 LEGS , SMALL AMOUNT OF BROWN FLUID FROM VAGINA | 1 |
| BLEEDING FROM HEMORRHOIDS | 1 |
| Other values (11) |
Length
| Max length | 65 |
|---|---|
| Median length | 28 |
| Mean length | 27 |
| Min length | 6 |
Characters and Unicode
| Total characters | 486 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 15 ? |
|---|---|
| Unique (%) | 83.3% |
Sample
| 1st row | VAGINAL LEAKAGE DUE TO MISCARRIAGE |
|---|---|
| 2nd row | HEMOTYPSY |
| 3rd row | SMALL AMOUNT OF BROWN FLUID FROM VAGINA |
| 4th row | RECOVERY RASH ON 2 LEGS , SMALL AMOUNT OF BROWN FLUID FROM VAGINA |
| 5th row | CONJUNCTIVA BLEEDING IN THE RIGHT EYE. |
| Value | Count | Frequency (%) |
| FEW VAGINAL DISCHARGE | 3 | < 0.1% |
| VAGINAL LEAKAGE DUE TO MISCARRIAGE | 1 | < 0.1% |
| HEMOTYPSIE | 1 | < 0.1% |
| RECOVERY RASH ON 2 LEGS , SMALL AMOUNT OF BROWN FLUID FROM VAGINA | 1 | < 0.1% |
| BLEEDING FROM HEMORRHOIDS | 1 | < 0.1% |
| HEMOTYPSY | 1 | < 0.1% |
| CONJUNCTIVA BLEEDING IN THE RIGHT EYE. | 1 | < 0.1% |
| LOCHIA | 1 | < 0.1% |
| MILD HEMOPTYSIS | 1 | < 0.1% |
| LOCHIA REDUCED | 1 | < 0.1% |
| Other values (6) | 6 | 0.1% |
| (Missing) | 10082 |
| Value | Count | Frequency (%) |
| bleeding | 7 | 9.2% |
| in | 6 | 7.9% |
| eye | 5 | 6.6% |
| conjunctival | 4 | 5.3% |
| vaginal | 4 | 5.3% |
| right | 4 | 5.3% |
| 2 | 3 | 3.9% |
| discharge | 3 | 3.9% |
| from | 3 | 3.9% |
| few | 3 | 3.9% |
| Other values (25) | 34 |
Most occurring characters
| Value | Count | Frequency (%) |
| 58 | 11.9% | |
| E | 48 | 9.9% |
| I | 42 | 8.6% |
| N | 36 | 7.4% |
| A | 32 | 6.6% |
| L | 26 | 5.3% |
| O | 25 | 5.1% |
| G | 23 | 4.7% |
| C | 20 | 4.1% |
| R | 20 | 4.1% |
| Other values (17) | 156 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 421 | |
| Space Separator | 58 | 11.9% |
| Other Punctuation | 4 | 0.8% |
| Decimal Number | 3 | 0.6% |
Most frequent character per category
| Value | Count | Frequency (%) |
| E | 48 | 11.4% |
| I | 42 | 10.0% |
| N | 36 | 8.6% |
| A | 32 | 7.6% |
| L | 26 | 6.2% |
| O | 25 | 5.9% |
| G | 23 | 5.5% |
| C | 20 | 4.8% |
| R | 20 | 4.8% |
| T | 18 | 4.3% |
| Other values (13) | 131 |
| Value | Count | Frequency (%) |
| . | 3 | |
| , | 1 | 25.0% |
| Value | Count | Frequency (%) |
| 58 |
| Value | Count | Frequency (%) |
| 2 | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 421 | |
| Common | 65 | 13.4% |
Most frequent character per script
| Value | Count | Frequency (%) |
| E | 48 | 11.4% |
| I | 42 | 10.0% |
| N | 36 | 8.6% |
| A | 32 | 7.6% |
| L | 26 | 6.2% |
| O | 25 | 5.9% |
| G | 23 | 5.5% |
| C | 20 | 4.8% |
| R | 20 | 4.8% |
| T | 18 | 4.3% |
| Other values (13) | 131 |
| Value | Count | Frequency (%) |
| 58 | ||
| 2 | 3 | 4.6% |
| . | 3 | 4.6% |
| , | 1 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 486 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 58 | 11.9% | |
| E | 48 | 9.9% |
| I | 42 | 8.6% |
| N | 36 | 7.4% |
| A | 32 | 6.6% |
| L | 26 | 5.3% |
| O | 25 | 5.1% |
| G | 23 | 4.7% |
| C | 20 | 4.1% |
| R | 20 | 4.1% |
| Other values (17) | 156 |
bleeding_skin
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.0 KiB |
| False | |
|---|---|
| True | 136 |
| Value | Count | Frequency (%) |
| False | 9964 | |
| True | 136 | 1.3% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7723 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 264 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2113 | 20.9% |
| True | 264 | 2.6% |
| (Missing) | 7723 |
| Distinct | 39 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 7453 |
| Missing (%) | 73.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.69852663 |
|---|---|
| Minimum | 36.7 |
| Maximum | 41 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 36.7 |
|---|---|
| 5-th percentile | 37 |
| Q1 | 37.1 |
| median | 37.3 |
| Q3 | 38 |
| 95-th percentile | 39.5 |
| Maximum | 41 |
| Range | 4.3 |
| Interquartile range (IQR) | 0.9 |
Descriptive statistics
| Standard deviation | 0.8759498147 |
|---|---|
| Coefficient of variation (CV) | 0.02323565117 |
| Kurtosis | 0.5322576867 |
| Mean | 37.69852663 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | 1.299861569 |
| Sum | 99788 |
| Variance | 0.7672880778 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 37 | 633 | 6.3% |
| 37.2 | 426 | 4.2% |
| 37.3 | 234 | 2.3% |
| 37.1 | 230 | 2.3% |
| 39 | 200 | 2.0% |
| 38 | 156 | 1.5% |
| 37.4 | 147 | 1.5% |
| 37.5 | 100 | 1.0% |
| 39.5 | 81 | 0.8% |
| 38.5 | 73 | 0.7% |
| Other values (29) | 367 | 3.6% |
| (Missing) | 7453 |
| Value | Count | Frequency (%) |
| 36.7 | 1 | < 0.1% |
| 36.8 | 2 | < 0.1% |
| 36.9 | 3 | < 0.1% |
| 37 | 633 | |
| 37.1 | 230 | 2.3% |
| Value | Count | Frequency (%) |
| 41 | 4 | |
| 40.6 | 1 | < 0.1% |
| 40.5 | 2 | |
| 40.4 | 2 | |
| 40.3 | 2 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7723 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 741 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 1636 | 16.2% |
| True | 741 | 7.3% |
| (Missing) | 7723 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 9490 |
| Missing (%) | 94.0% |
| Memory size | 79.0 KiB |
| True | 610 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) |
| True | 610 | 6.0% |
| (Missing) | 9490 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 10 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2365 | 23.4% |
| True | 10 | 0.1% |
| (Missing) | 7725 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 330 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2045 | 20.2% |
| True | 330 | 3.3% |
| (Missing) | 7725 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 2 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2373 | 23.5% |
| True | 2 | < 0.1% |
| (Missing) | 7725 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 9738 |
| Missing (%) | 96.4% |
| Memory size | 79.0 KiB |
| True | 362 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) |
| True | 362 | 3.6% |
| (Missing) | 9738 |
| Distinct | 273 |
|---|---|
| Distinct (%) | 12.1% |
| Missing | 7842 |
| Missing (%) | 77.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74.84588131 |
|---|---|
| Minimum | 8 |
| Maximum | 1404 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 8 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 34 |
| median | 49 |
| Q3 | 75.75 |
| 95-th percentile | 204.15 |
| Maximum | 1404 |
| Range | 1396 |
| Interquartile range (IQR) | 41.75 |
Descriptive statistics
| Standard deviation | 99.67445541 |
|---|---|
| Coefficient of variation (CV) | 1.33172933 |
| Kurtosis | 56.07149742 |
| Mean | 74.84588131 |
| Median Absolute Deviation (MAD) | 19 |
| Skewness | 6.311617762 |
| Sum | 169002 |
| Variance | 9934.997061 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 37 | 48 | 0.5% |
| 35 | 48 | 0.5% |
| 40 | 48 | 0.5% |
| 31 | 45 | 0.4% |
| 42 | 44 | 0.4% |
| 34 | 43 | 0.4% |
| 46 | 41 | 0.4% |
| 27 | 40 | 0.4% |
| 28 | 38 | 0.4% |
| 45 | 38 | 0.4% |
| Other values (263) | 1825 | 18.1% |
| (Missing) | 7842 |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 11 | 4 | |
| 12 | 1 | < 0.1% |
| 13 | 2 |
| Value | Count | Frequency (%) |
| 1404 | 1 | |
| 1347 | 1 | |
| 1264 | 1 | |
| 1136 | 1 | |
| 967 | 1 |
dbp
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.0 KiB |
| nan | |
|---|---|
| 60.0 | |
| 55.0 | 2 |
| 50.0 | 2 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.155148515 |
| Min length | 3 |
Characters and Unicode
| Total characters | 31867 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | nan |
|---|---|
| 2nd row | nan |
| 3rd row | nan |
| 4th row | nan |
| 5th row | nan |
| Value | Count | Frequency (%) |
| nan | 8533 | |
| 60.0 | 1563 | 15.5% |
| 55.0 | 2 | < 0.1% |
| 50.0 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| nan | 8533 | |
| 60.0 | 1563 | 15.5% |
| 55.0 | 2 | < 0.1% |
| 50.0 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 17066 | |
| a | 8533 | |
| 0 | 3132 | 9.8% |
| . | 1567 | 4.9% |
| 6 | 1563 | 4.9% |
| 5 | 6 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25599 | |
| Decimal Number | 4701 | 14.8% |
| Other Punctuation | 1567 | 4.9% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 3132 | |
| 6 | 1563 | |
| 5 | 6 | 0.1% |
| Value | Count | Frequency (%) |
| n | 17066 | |
| a | 8533 |
| Value | Count | Frequency (%) |
| . | 1567 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25599 | |
| Common | 6268 | 19.7% |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 3132 | |
| . | 1567 | |
| 6 | 1563 | |
| 5 | 6 | 0.1% |
| Value | Count | Frequency (%) |
| n | 17066 | |
| a | 8533 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31867 |
Most frequent character per block
| Value | Count | Frequency (%) |
| n | 17066 | |
| a | 8533 | |
| 0 | 3132 | 9.8% |
| . | 1567 | 4.9% |
| 6 | 1563 | 4.9% |
| 5 | 6 | < 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 9818 |
| Missing (%) | 97.2% |
| Memory size | 79.0 KiB |
| True | 282 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) |
| True | 282 | 2.8% |
| (Missing) | 9818 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 9436 |
| Missing (%) | 93.4% |
| Memory size | 79.0 KiB |
| True | 664 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) |
| True | 664 | 6.6% |
| (Missing) | 9436 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 9436 |
| Missing (%) | 93.4% |
| Memory size | 79.0 KiB |
| True | 664 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) |
| True | 664 | 6.6% |
| (Missing) | 9436 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 9636 |
| Missing (%) | 95.4% |
| Memory size | 79.0 KiB |
| True | 464 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) |
| True | 464 | 4.6% |
| (Missing) | 9636 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 9536 |
| Missing (%) | 94.4% |
| Memory size | 79.0 KiB |
| True | 560 |
|---|---|
| False | 4 |
| (Missing) |
| Value | Count | Frequency (%) |
| True | 560 | 5.5% |
| False | 4 | < 0.1% |
| (Missing) | 9536 |
| Distinct | 352 |
|---|---|
| Distinct (%) | 15.8% |
| Missing | 7873 |
| Missing (%) | 78.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.893385721 |
|---|---|
| Minimum | 0.46 |
| Maximum | 7.64 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 0.46 |
|---|---|
| 5-th percentile | 1.82 |
| Q1 | 2.34 |
| median | 2.77 |
| Q3 | 3.285 |
| 95-th percentile | 4.43 |
| Maximum | 7.64 |
| Range | 7.18 |
| Interquartile range (IQR) | 0.945 |
Descriptive statistics
| Standard deviation | 0.8309164426 |
|---|---|
| Coefficient of variation (CV) | 0.2871779026 |
| Kurtosis | 2.767363604 |
| Mean | 2.893385721 |
| Median Absolute Deviation (MAD) | 0.46 |
| Skewness | 1.213297924 |
| Sum | 6443.57 |
| Variance | 0.6904221346 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.76 | 27 | 0.3% |
| 2.89 | 26 | 0.3% |
| 2.85 | 25 | 0.2% |
| 2.67 | 25 | 0.2% |
| 2.57 | 24 | 0.2% |
| 2.47 | 21 | 0.2% |
| 2.46 | 21 | 0.2% |
| 3.29 | 21 | 0.2% |
| 2.54 | 21 | 0.2% |
| 2.83 | 21 | 0.2% |
| Other values (342) | 1995 | 19.8% |
| (Missing) | 7873 |
| Value | Count | Frequency (%) |
| 0.46 | 1 | |
| 0.89 | 1 | |
| 1.03 | 1 | |
| 1.06 | 1 | |
| 1.12 | 1 |
| Value | Count | Frequency (%) |
| 7.64 | 1 | |
| 6.97 | 1 | |
| 6.85 | 2 | |
| 6.59 | 1 | |
| 6.42 | 1 |
| Distinct | 19 |
|---|---|
| Distinct (%) | 44.2% |
| Missing | 10057 |
| Missing (%) | 99.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43.58139535 |
|---|---|
| Minimum | 30 |
| Maximum | 55 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 30 |
|---|---|
| 5-th percentile | 35 |
| Q1 | 40 |
| median | 43 |
| Q3 | 48.5 |
| 95-th percentile | 52 |
| Maximum | 55 |
| Range | 25 |
| Interquartile range (IQR) | 8.5 |
Descriptive statistics
| Standard deviation | 5.807353951 |
|---|---|
| Coefficient of variation (CV) | 0.1332530522 |
| Kurtosis | -0.3344634513 |
| Mean | 43.58139535 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.1323457476 |
| Sum | 1874 |
| Variance | 33.72535991 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 42 | 4 | < 0.1% |
| 45 | 4 | < 0.1% |
| 49 | 4 | < 0.1% |
| 41 | 4 | < 0.1% |
| 38 | 3 | < 0.1% |
| 40 | 3 | < 0.1% |
| 48 | 2 | < 0.1% |
| 37 | 2 | < 0.1% |
| 50 | 2 | < 0.1% |
| 44 | 2 | < 0.1% |
| Other values (9) | 13 | 0.1% |
| (Missing) | 10057 |
| Value | Count | Frequency (%) |
| 30 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 35 | 2 | |
| 37 | 2 | |
| 38 | 3 |
| Value | Count | Frequency (%) |
| 55 | 1 | |
| 54 | 1 | |
| 52 | 2 | |
| 51 | 1 | |
| 50 | 2 |
| Distinct | 240 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 5743 |
| Missing (%) | 56.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.33218958 |
|---|---|
| Minimum | 22.8 |
| Maximum | 54.15 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 22.8 |
|---|---|
| 5-th percentile | 30.3 |
| Q1 | 34.6 |
| median | 37.5 |
| Q3 | 40 |
| 95-th percentile | 43.9 |
| Maximum | 54.15 |
| Range | 31.35 |
| Interquartile range (IQR) | 5.4 |
Descriptive statistics
| Standard deviation | 4.14872744 |
|---|---|
| Coefficient of variation (CV) | 0.1111300325 |
| Kurtosis | 0.1905244926 |
| Mean | 37.33218958 |
| Median Absolute Deviation (MAD) | 2.7 |
| Skewness | -0.06158883624 |
| Sum | 162656.35 |
| Variance | 17.21193937 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 38.4 | 57 | 0.6% |
| 36.1 | 53 | 0.5% |
| 37.6 | 52 | 0.5% |
| 39 | 52 | 0.5% |
| 37.2 | 52 | 0.5% |
| 39.5 | 51 | 0.5% |
| 37.1 | 50 | 0.5% |
| 38.7 | 50 | 0.5% |
| 36.7 | 49 | 0.5% |
| 39.3 | 49 | 0.5% |
| Other values (230) | 3842 | |
| (Missing) | 5743 |
| Value | Count | Frequency (%) |
| 22.8 | 1 | |
| 23.2 | 1 | |
| 23.6 | 1 | |
| 24.8 | 2 | |
| 25.4 | 2 |
| Value | Count | Frequency (%) |
| 54.15 | 1 | < 0.1% |
| 52.5 | 3 | |
| 52.4 | 1 | < 0.1% |
| 52.2 | 1 | < 0.1% |
| 52 | 1 | < 0.1% |
| Distinct | 105 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 5743 |
| Missing (%) | 56.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.33187055 |
|---|---|
| Minimum | 6.8 |
| Maximum | 18.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 6.8 |
|---|---|
| 5-th percentile | 9.98 |
| Q1 | 11.4 |
| median | 12.4 |
| Q3 | 13.2 |
| 95-th percentile | 14.5 |
| Maximum | 18.5 |
| Range | 11.7 |
| Interquartile range (IQR) | 1.8 |
Descriptive statistics
| Standard deviation | 1.392986883 |
|---|---|
| Coefficient of variation (CV) | 0.1129582797 |
| Kurtosis | 0.3528813618 |
| Mean | 12.33187055 |
| Median Absolute Deviation (MAD) | 0.9 |
| Skewness | -0.064014948 |
| Sum | 53729.96 |
| Variance | 1.940412455 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 12.5 | 157 | 1.6% |
| 12.7 | 144 | 1.4% |
| 12.3 | 140 | 1.4% |
| 12.9 | 135 | 1.3% |
| 13 | 132 | 1.3% |
| 12 | 125 | 1.2% |
| 12.2 | 123 | 1.2% |
| 12.4 | 122 | 1.2% |
| 12.8 | 119 | 1.2% |
| 12.6 | 118 | 1.2% |
| Other values (95) | 3042 | |
| (Missing) | 5743 |
| Value | Count | Frequency (%) |
| 6.8 | 1 | |
| 7.1 | 1 | |
| 7.4 | 1 | |
| 7.44 | 1 | |
| 7.5 | 1 |
| Value | Count | Frequency (%) |
| 18.5 | 1 | |
| 18 | 1 | |
| 17.6 | 1 | |
| 17.4 | 1 | |
| 17.1 | 2 |
headache_level
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.0 KiB |
| nan | |
|---|---|
| 2.0 | 393 |
| 1.0 | 150 |
| 3.0 | 97 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 30300 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | nan |
| 3rd row | nan |
| 4th row | nan |
| 5th row | nan |
| Value | Count | Frequency (%) |
| nan | 9460 | |
| 2.0 | 393 | 3.9% |
| 1.0 | 150 | 1.5% |
| 3.0 | 97 | 1.0% |
| Value | Count | Frequency (%) |
| nan | 9460 | |
| 2.0 | 393 | 3.9% |
| 1.0 | 150 | 1.5% |
| 3.0 | 97 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 18920 | |
| a | 9460 | |
| . | 640 | 2.1% |
| 0 | 640 | 2.1% |
| 2 | 393 | 1.3% |
| 1 | 150 | 0.5% |
| 3 | 97 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28380 | |
| Decimal Number | 1280 | 4.2% |
| Other Punctuation | 640 | 2.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 640 | |
| 2 | 393 | |
| 1 | 150 | 11.7% |
| 3 | 97 | 7.6% |
| Value | Count | Frequency (%) |
| n | 18920 | |
| a | 9460 |
| Value | Count | Frequency (%) |
| . | 640 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28380 | |
| Common | 1920 | 6.3% |
Most frequent character per script
| Value | Count | Frequency (%) |
| . | 640 | |
| 0 | 640 | |
| 2 | 393 | |
| 1 | 150 | 7.8% |
| 3 | 97 | 5.1% |
| Value | Count | Frequency (%) |
| n | 18920 | |
| a | 9460 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30300 |
Most frequent character per block
| Value | Count | Frequency (%) |
| n | 18920 | |
| a | 9460 | |
| . | 640 | 2.1% |
| 0 | 640 | 2.1% |
| 2 | 393 | 1.3% |
| 1 | 150 | 0.5% |
| 3 | 97 | 0.3% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7723 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 5 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2372 | 23.5% |
| True | 5 | < 0.1% |
| (Missing) | 7723 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7723 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 7 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2370 | 23.5% |
| True | 7 | 0.1% |
| (Missing) | 7723 |
| Distinct | 44 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 7835 |
| Missing (%) | 77.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.024379691 |
|---|---|
| Minimum | 1 |
| Maximum | 4.04 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1.01 |
| 95-th percentile | 1.14 |
| Maximum | 4.04 |
| Range | 3.04 |
| Interquartile range (IQR) | 0.01 |
Descriptive statistics
| Standard deviation | 0.08710537835 |
|---|---|
| Coefficient of variation (CV) | 0.08503231675 |
| Kurtosis | 640.767858 |
| Mean | 1.024379691 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 19.77231261 |
| Sum | 2320.22 |
| Variance | 0.007587346937 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1663 | 16.5% |
| 1.02 | 82 | 0.8% |
| 1.01 | 70 | 0.7% |
| 1.04 | 48 | 0.5% |
| 1.03 | 43 | 0.4% |
| 1.05 | 39 | 0.4% |
| 1.09 | 37 | 0.4% |
| 1.06 | 34 | 0.3% |
| 1.07 | 30 | 0.3% |
| 1.1 | 29 | 0.3% |
| Other values (34) | 190 | 1.9% |
| (Missing) | 7835 |
| Value | Count | Frequency (%) |
| 1 | 1663 | |
| 1.01 | 70 | 0.7% |
| 1.02 | 82 | 0.8% |
| 1.03 | 43 | 0.4% |
| 1.04 | 48 | 0.5% |
| Value | Count | Frequency (%) |
| 4.04 | 1 | |
| 1.87 | 1 | |
| 1.52 | 1 | |
| 1.48 | 1 | |
| 1.44 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 3 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2372 | 23.5% |
| True | 3 | < 0.1% |
| (Missing) | 7725 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.0 KiB |
| nan | |
|---|---|
| 2.0 | 356 |
| 1.0 | 150 |
| 3.0 | 50 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 30300 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | nan |
| 3rd row | nan |
| 4th row | nan |
| 5th row | nan |
| Value | Count | Frequency (%) |
| nan | 9544 | |
| 2.0 | 356 | 3.5% |
| 1.0 | 150 | 1.5% |
| 3.0 | 50 | 0.5% |
| Value | Count | Frequency (%) |
| nan | 9544 | |
| 2.0 | 356 | 3.5% |
| 1.0 | 150 | 1.5% |
| 3.0 | 50 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 19088 | |
| a | 9544 | |
| . | 556 | 1.8% |
| 0 | 556 | 1.8% |
| 2 | 356 | 1.2% |
| 1 | 150 | 0.5% |
| 3 | 50 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28632 | |
| Decimal Number | 1112 | 3.7% |
| Other Punctuation | 556 | 1.8% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 556 | |
| 2 | 356 | |
| 1 | 150 | 13.5% |
| 3 | 50 | 4.5% |
| Value | Count | Frequency (%) |
| n | 19088 | |
| a | 9544 |
| Value | Count | Frequency (%) |
| . | 556 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28632 | |
| Common | 1668 | 5.5% |
Most frequent character per script
| Value | Count | Frequency (%) |
| . | 556 | |
| 0 | 556 | |
| 2 | 356 | |
| 1 | 150 | 9.0% |
| 3 | 50 | 3.0% |
| Value | Count | Frequency (%) |
| n | 19088 | |
| a | 9544 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30300 |
Most frequent character per block
| Value | Count | Frequency (%) |
| n | 19088 | |
| a | 9544 | |
| . | 556 | 1.8% |
| 0 | 556 | 1.8% |
| 2 | 356 | 1.2% |
| 1 | 150 | 0.5% |
| 3 | 50 | 0.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 6 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2369 | 23.5% |
| True | 6 | 0.1% |
| (Missing) | 7725 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 50 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2325 | 23.0% |
| True | 50 | 0.5% |
| (Missing) | 7725 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.0 KiB |
| nan | |
|---|---|
| 2.0 | 28 |
| 1.0 | 19 |
| 18.0 | 2 |
| 14.0 | 1 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.00029703 |
| Min length | 3 |
Characters and Unicode
| Total characters | 30303 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | nan |
|---|---|
| 2nd row | nan |
| 3rd row | nan |
| 4th row | nan |
| 5th row | nan |
| Value | Count | Frequency (%) |
| nan | 10050 | |
| 2.0 | 28 | 0.3% |
| 1.0 | 19 | 0.2% |
| 18.0 | 2 | < 0.1% |
| 14.0 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| nan | 10050 | |
| 2.0 | 28 | 0.3% |
| 1.0 | 19 | 0.2% |
| 18.0 | 2 | < 0.1% |
| 14.0 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 20100 | |
| a | 10050 | |
| . | 50 | 0.2% |
| 0 | 50 | 0.2% |
| 2 | 28 | 0.1% |
| 1 | 22 | 0.1% |
| 8 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 30150 | |
| Decimal Number | 103 | 0.3% |
| Other Punctuation | 50 | 0.2% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 50 | |
| 2 | 28 | |
| 1 | 22 | |
| 8 | 2 | 1.9% |
| 4 | 1 | 1.0% |
| Value | Count | Frequency (%) |
| n | 20100 | |
| a | 10050 |
| Value | Count | Frequency (%) |
| . | 50 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30150 | |
| Common | 153 | 0.5% |
Most frequent character per script
| Value | Count | Frequency (%) |
| . | 50 | |
| 0 | 50 | |
| 2 | 28 | |
| 1 | 22 | |
| 8 | 2 | 1.3% |
| 4 | 1 | 0.7% |
| Value | Count | Frequency (%) |
| n | 20100 | |
| a | 10050 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30303 |
Most frequent character per block
| Value | Count | Frequency (%) |
| n | 20100 | |
| a | 10050 | |
| . | 50 | 0.2% |
| 0 | 50 | 0.2% |
| 2 | 28 | 0.1% |
| 1 | 22 | 0.1% |
| 8 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 18 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2357 | 23.3% |
| True | 18 | 0.2% |
| (Missing) | 7725 |
| Distinct | 667 |
|---|---|
| Distinct (%) | 15.3% |
| Missing | 5744 |
| Missing (%) | 56.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 33.00003214 |
|---|---|
| Minimum | 1.4 |
| Maximum | 77.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 1.4 |
|---|---|
| 5-th percentile | 10.9 |
| Q1 | 21.6 |
| median | 32.05 |
| Q3 | 43 |
| 95-th percentile | 59.1 |
| Maximum | 77.9 |
| Range | 76.5 |
| Interquartile range (IQR) | 21.4 |
Descriptive statistics
| Standard deviation | 14.63303001 |
|---|---|
| Coefficient of variation (CV) | 0.44342472 |
| Kurtosis | -0.4589838512 |
| Mean | 33.00003214 |
| Median Absolute Deviation (MAD) | 10.65 |
| Skewness | 0.3311404581 |
| Sum | 143748.14 |
| Variance | 214.1255673 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 37.5 | 23 | 0.2% |
| 21.6 | 19 | 0.2% |
| 42.9 | 19 | 0.2% |
| 28.8 | 18 | 0.2% |
| 32.4 | 17 | 0.2% |
| 36.4 | 17 | 0.2% |
| 21.5 | 16 | 0.2% |
| 33.9 | 16 | 0.2% |
| 33.4 | 16 | 0.2% |
| 23.6 | 16 | 0.2% |
| Other values (657) | 4179 | |
| (Missing) | 5744 |
| Value | Count | Frequency (%) |
| 1.4 | 1 | |
| 1.5 | 1 | |
| 2.1 | 1 | |
| 3.2 | 1 | |
| 3.9 | 1 |
| Value | Count | Frequency (%) |
| 77.9 | 1 | |
| 77 | 1 | |
| 76.4 | 1 | |
| 75.3 | 1 | |
| 74.7 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7723 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 3 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2374 | 23.5% |
| True | 3 | < 0.1% |
| (Missing) | 7723 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 2 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2373 | 23.5% |
| True | 2 | < 0.1% |
| (Missing) | 7725 |
| Distinct | 215 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 5744 |
| Missing (%) | 56.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.396997245 |
|---|---|
| Minimum | 0.3 |
| Maximum | 42.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 0.3 |
|---|---|
| 5-th percentile | 3.4 |
| Q1 | 5.3 |
| median | 6.8 |
| Q3 | 8.9 |
| 95-th percentile | 13.1 |
| Maximum | 42.8 |
| Range | 42.5 |
| Interquartile range (IQR) | 3.6 |
Descriptive statistics
| Standard deviation | 3.236674433 |
|---|---|
| Coefficient of variation (CV) | 0.4375659914 |
| Kurtosis | 10.68496379 |
| Mean | 7.396997245 |
| Median Absolute Deviation (MAD) | 1.7 |
| Skewness | 2.005638965 |
| Sum | 32221.32 |
| Variance | 10.47606138 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.4 | 84 | 0.8% |
| 6.6 | 78 | 0.8% |
| 5.5 | 78 | 0.8% |
| 5.9 | 77 | 0.8% |
| 5.8 | 76 | 0.8% |
| 5.1 | 76 | 0.8% |
| 5.4 | 75 | 0.7% |
| 6.9 | 74 | 0.7% |
| 6.3 | 74 | 0.7% |
| 6.2 | 72 | 0.7% |
| Other values (205) | 3592 | |
| (Missing) | 5744 |
| Value | Count | Frequency (%) |
| 0.3 | 1 | |
| 0.8 | 1 | |
| 1 | 1 | |
| 1.3 | 1 | |
| 1.5 | 2 |
| Value | Count | Frequency (%) |
| 42.8 | 1 | |
| 37 | 1 | |
| 35.6 | 1 | |
| 35.3 | 1 | |
| 30.3 | 1 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.0 KiB |
| nan | |
|---|---|
| 2.0 | 381 |
| 1.0 | 166 |
| 3.0 | 56 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 30300 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | nan |
| 3rd row | nan |
| 4th row | nan |
| 5th row | nan |
| Value | Count | Frequency (%) |
| nan | 9497 | |
| 2.0 | 381 | 3.8% |
| 1.0 | 166 | 1.6% |
| 3.0 | 56 | 0.6% |
| Value | Count | Frequency (%) |
| nan | 9497 | |
| 2.0 | 381 | 3.8% |
| 1.0 | 166 | 1.6% |
| 3.0 | 56 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 18994 | |
| a | 9497 | |
| . | 603 | 2.0% |
| 0 | 603 | 2.0% |
| 2 | 381 | 1.3% |
| 1 | 166 | 0.5% |
| 3 | 56 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28491 | |
| Decimal Number | 1206 | 4.0% |
| Other Punctuation | 603 | 2.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 603 | |
| 2 | 381 | |
| 1 | 166 | 13.8% |
| 3 | 56 | 4.6% |
| Value | Count | Frequency (%) |
| n | 18994 | |
| a | 9497 |
| Value | Count | Frequency (%) |
| . | 603 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28491 | |
| Common | 1809 | 6.0% |
Most frequent character per script
| Value | Count | Frequency (%) |
| . | 603 | |
| 0 | 603 | |
| 2 | 381 | |
| 1 | 166 | 9.2% |
| 3 | 56 | 3.1% |
| Value | Count | Frequency (%) |
| n | 18994 | |
| a | 9497 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30300 |
Most frequent character per block
| Value | Count | Frequency (%) |
| n | 18994 | |
| a | 9497 | |
| . | 603 | 2.0% |
| 0 | 603 | 2.0% |
| 2 | 381 | 1.3% |
| 1 | 166 | 0.5% |
| 3 | 56 | 0.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 9603 |
| Missing (%) | 95.1% |
| Memory size | 79.0 KiB |
| True | 496 |
|---|---|
| False | 1 |
| (Missing) |
| Value | Count | Frequency (%) |
| True | 496 | 4.9% |
| False | 1 | < 0.1% |
| (Missing) | 9603 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 5 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2370 | 23.5% |
| True | 5 | < 0.1% |
| (Missing) | 7725 |
| Distinct | 746 |
|---|---|
| Distinct (%) | 17.1% |
| Missing | 5743 |
| Missing (%) | 56.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52.80841175 |
|---|---|
| Minimum | 6.1 |
| Maximum | 96.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 6.1 |
|---|---|
| 5-th percentile | 24.1 |
| Q1 | 38.6 |
| median | 53.1 |
| Q3 | 67.2 |
| 95-th percentile | 80.9 |
| Maximum | 96.8 |
| Range | 90.7 |
| Interquartile range (IQR) | 28.6 |
Descriptive statistics
| Standard deviation | 17.73432742 |
|---|---|
| Coefficient of variation (CV) | 0.3358239121 |
| Kurtosis | -0.8557399342 |
| Mean | 52.80841175 |
| Median Absolute Deviation (MAD) | 14.3 |
| Skewness | -0.05554027401 |
| Sum | 230086.25 |
| Variance | 314.5063692 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 52.3 | 15 | 0.1% |
| 52.9 | 14 | 0.1% |
| 45.6 | 14 | 0.1% |
| 47.8 | 14 | 0.1% |
| 50.7 | 14 | 0.1% |
| 53.3 | 14 | 0.1% |
| 66.1 | 14 | 0.1% |
| 57.1 | 14 | 0.1% |
| 57.5 | 14 | 0.1% |
| 64.9 | 13 | 0.1% |
| Other values (736) | 4217 | |
| (Missing) | 5743 |
| Value | Count | Frequency (%) |
| 6.1 | 1 | |
| 6.3 | 1 | |
| 7.9 | 1 | |
| 11.1 | 1 | |
| 11.3 | 1 |
| Value | Count | Frequency (%) |
| 96.8 | 1 | |
| 94.9 | 1 | |
| 93.4 | 1 | |
| 92 | 1 | |
| 91.9 | 2 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7370 |
| Missing (%) | 73.0% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 109 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2621 | 26.0% |
| True | 109 | 1.1% |
| (Missing) | 7370 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7723 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 5 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2372 | 23.5% |
| True | 5 | < 0.1% |
| (Missing) | 7723 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7723 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 6 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2371 | 23.5% |
| True | 6 | 0.1% |
| (Missing) | 7723 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7723 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 2 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2375 | 23.5% |
| True | 2 | < 0.1% |
| (Missing) | 7723 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7723 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 1413 | 14.0% |
| True | 964 | 9.5% |
| (Missing) | 7723 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 208 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2167 | 21.5% |
| True | 208 | 2.1% |
| (Missing) | 7725 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 3 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2372 | 23.5% |
| True | 3 | < 0.1% |
| (Missing) | 7725 |
| Distinct | 486 |
|---|---|
| Distinct (%) | 11.2% |
| Missing | 5744 |
| Missing (%) | 56.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 126.0206382 |
|---|---|
| Minimum | 3 |
| Maximum | 619 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 59 |
| median | 101 |
| Q3 | 158 |
| 95-th percentile | 346 |
| Maximum | 619 |
| Range | 616 |
| Interquartile range (IQR) | 99 |
Descriptive statistics
| Standard deviation | 97.83918779 |
|---|---|
| Coefficient of variation (CV) | 0.776374324 |
| Kurtosis | 2.681058686 |
| Mean | 126.0206382 |
| Median Absolute Deviation (MAD) | 48 |
| Skewness | 1.585932398 |
| Sum | 548945.9 |
| Variance | 9572.506667 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 96 | 36 | 0.4% |
| 70 | 35 | 0.3% |
| 87 | 34 | 0.3% |
| 27 | 33 | 0.3% |
| 83 | 33 | 0.3% |
| 69 | 33 | 0.3% |
| 115 | 32 | 0.3% |
| 94 | 31 | 0.3% |
| 31 | 31 | 0.3% |
| 81 | 31 | 0.3% |
| Other values (476) | 4027 | |
| (Missing) | 5744 |
| Value | Count | Frequency (%) |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
| 4.8 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 5 |
| Value | Count | Frequency (%) |
| 619 | 1 | |
| 593 | 1 | |
| 591 | 1 | |
| 589 | 1 | |
| 572 | 1 |
| Distinct | 45 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 7453 |
| Missing (%) | 73.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 90.19569324 |
|---|---|
| Minimum | 66 |
| Maximum | 128 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 66 |
|---|---|
| 5-th percentile | 80 |
| Q1 | 84 |
| median | 88 |
| Q3 | 94 |
| 95-th percentile | 106 |
| Maximum | 128 |
| Range | 62 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 8.296349471 |
|---|---|
| Coefficient of variation (CV) | 0.0919816587 |
| Kurtosis | 1.478700907 |
| Mean | 90.19569324 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 0.8691030529 |
| Sum | 238748 |
| Variance | 68.82941455 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 88 | 348 | 3.4% |
| 86 | 336 | 3.3% |
| 90 | 324 | 3.2% |
| 84 | 277 | 2.7% |
| 100 | 200 | 2.0% |
| 92 | 193 | 1.9% |
| 80 | 154 | 1.5% |
| 96 | 142 | 1.4% |
| 94 | 134 | 1.3% |
| 82 | 99 | 1.0% |
| Other values (35) | 440 | 4.4% |
| (Missing) | 7453 |
| Value | Count | Frequency (%) |
| 66 | 1 | < 0.1% |
| 68 | 2 | < 0.1% |
| 70 | 8 | |
| 72 | 10 | |
| 74 | 13 |
| Value | Count | Frequency (%) |
| 128 | 1 | < 0.1% |
| 126 | 3 | < 0.1% |
| 124 | 2 | < 0.1% |
| 122 | 6 | |
| 120 | 8 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 4 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2371 | 23.5% |
| True | 4 | < 0.1% |
| (Missing) | 7725 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 2 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2373 | 23.5% |
| True | 2 | < 0.1% |
| (Missing) | 7725 |
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 7453 |
| Missing (%) | 73.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.51681148 |
|---|---|
| Minimum | 15 |
| Maximum | 28 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 15 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 20 |
| median | 20 |
| Q3 | 20 |
| 95-th percentile | 24 |
| Maximum | 28 |
| Range | 13 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.205147985 |
|---|---|
| Coefficient of variation (CV) | 0.05873953592 |
| Kurtosis | 6.537512413 |
| Mean | 20.51681148 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.178163277 |
| Sum | 54308 |
| Variance | 1.452381666 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 2082 | 20.6% |
| 22 | 395 | 3.9% |
| 24 | 127 | 1.3% |
| 18 | 13 | 0.1% |
| 26 | 8 | 0.1% |
| 28 | 7 | 0.1% |
| 17 | 5 | < 0.1% |
| 23 | 4 | < 0.1% |
| 19 | 2 | < 0.1% |
| 15 | 1 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
| (Missing) | 7453 |
| Value | Count | Frequency (%) |
| 15 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 17 | 5 | < 0.1% |
| 18 | 13 | |
| 19 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 28 | 7 | 0.1% |
| 26 | 8 | 0.1% |
| 25 | 1 | < 0.1% |
| 24 | 127 | |
| 23 | 4 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 1 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2374 | 23.5% |
| True | 1 | < 0.1% |
| (Missing) | 7725 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 36 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2339 | 23.2% |
| True | 36 | 0.4% |
| (Missing) | 7725 |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 7453 |
| Missing (%) | 73.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 101.7476388 |
|---|---|
| Minimum | 85 |
| Maximum | 140 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 85 |
|---|---|
| 5-th percentile | 90 |
| Q1 | 100 |
| median | 100 |
| Q3 | 110 |
| 95-th percentile | 120 |
| Maximum | 140 |
| Range | 55 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 8.443297662 |
|---|---|
| Coefficient of variation (CV) | 0.08298273806 |
| Kurtosis | 0.3391123559 |
| Mean | 101.7476388 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.5211090295 |
| Sum | 269326 |
| Variance | 71.28927541 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 1260 | 12.5% |
| 110 | 674 | 6.7% |
| 90 | 547 | 5.4% |
| 120 | 140 | 1.4% |
| 130 | 14 | 0.1% |
| 95 | 3 | < 0.1% |
| 140 | 3 | < 0.1% |
| 105 | 2 | < 0.1% |
| 115 | 1 | < 0.1% |
| 96 | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
| (Missing) | 7453 |
| Value | Count | Frequency (%) |
| 85 | 1 | < 0.1% |
| 90 | 547 | |
| 95 | 3 | < 0.1% |
| 96 | 1 | < 0.1% |
| 100 | 1260 |
| Value | Count | Frequency (%) |
| 140 | 3 | < 0.1% |
| 130 | 14 | 0.1% |
| 125 | 1 | < 0.1% |
| 120 | 140 | |
| 115 | 1 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 217 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2158 | 21.4% |
| True | 217 | 2.1% |
| (Missing) | 7725 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 518 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 1857 | 18.4% |
| True | 518 | 5.1% |
| (Missing) | 7725 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 9912 |
| Missing (%) | 98.1% |
| Memory size | 79.0 KiB |
| True | 188 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) |
| True | 188 | 1.9% |
| (Missing) | 9912 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7725 |
| Missing (%) | 76.5% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2375 | 23.5% |
| (Missing) | 7725 |
| Distinct | 306 |
|---|---|
| Distinct (%) | 13.7% |
| Missing | 7874 |
| Missing (%) | 78.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.59604672 |
|---|---|
| Minimum | 21.3 |
| Maximum | 88.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 21.3 |
|---|---|
| 5-th percentile | 29.6 |
| Q1 | 33.8 |
| median | 37.8 |
| Q3 | 42.2 |
| 95-th percentile | 49.9 |
| Maximum | 88.1 |
| Range | 66.8 |
| Interquartile range (IQR) | 8.4 |
Descriptive statistics
| Standard deviation | 6.823288779 |
|---|---|
| Coefficient of variation (CV) | 0.1767872453 |
| Kurtosis | 4.512273081 |
| Mean | 38.59604672 |
| Median Absolute Deviation (MAD) | 4.2 |
| Skewness | 1.328285292 |
| Sum | 85914.8 |
| Variance | 46.55726976 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 35.4 | 24 | 0.2% |
| 32.5 | 21 | 0.2% |
| 38.7 | 21 | 0.2% |
| 36.9 | 20 | 0.2% |
| 34.1 | 20 | 0.2% |
| 33.9 | 19 | 0.2% |
| 33.6 | 19 | 0.2% |
| 38.8 | 18 | 0.2% |
| 35.8 | 18 | 0.2% |
| 38.1 | 17 | 0.2% |
| Other values (296) | 2029 | 20.1% |
| (Missing) | 7874 |
| Value | Count | Frequency (%) |
| 21.3 | 1 | |
| 23 | 1 | |
| 23.4 | 1 | |
| 23.7 | 2 | |
| 24.6 | 1 |
| Value | Count | Frequency (%) |
| 88.1 | 1 | |
| 78 | 1 | |
| 77.7 | 1 | |
| 76.2 | 1 | |
| 75.6 | 1 |
| Distinct | 68 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 7870 |
| Missing (%) | 77.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.56179372 |
|---|---|
| Minimum | 10.3 |
| Maximum | 40.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 10.3 |
|---|---|
| 5-th percentile | 11.3 |
| Q1 | 11.9 |
| median | 12.4 |
| Q3 | 13 |
| 95-th percentile | 14.355 |
| Maximum | 40.3 |
| Range | 30 |
| Interquartile range (IQR) | 1.1 |
Descriptive statistics
| Standard deviation | 1.139830063 |
|---|---|
| Coefficient of variation (CV) | 0.09073784273 |
| Kurtosis | 157.7731461 |
| Mean | 12.56179372 |
| Median Absolute Deviation (MAD) | 0.6 |
| Skewness | 7.212666 |
| Sum | 28012.8 |
| Variance | 1.299212573 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 12.2 | 120 | 1.2% |
| 12.1 | 115 | 1.1% |
| 12.5 | 114 | 1.1% |
| 12.3 | 113 | 1.1% |
| 12.4 | 110 | 1.1% |
| 11.9 | 108 | 1.1% |
| 12 | 104 | 1.0% |
| 12.6 | 101 | 1.0% |
| 11.8 | 94 | 0.9% |
| 11.6 | 84 | 0.8% |
| Other values (58) | 1167 | 11.6% |
| (Missing) | 7870 |
| Value | Count | Frequency (%) |
| 10.3 | 1 | < 0.1% |
| 10.4 | 1 | < 0.1% |
| 10.6 | 1 | < 0.1% |
| 10.7 | 4 | < 0.1% |
| 10.8 | 10 |
| Value | Count | Frequency (%) |
| 40.3 | 1 | |
| 18.2 | 1 | |
| 17.8 | 1 | |
| 17.3 | 1 | |
| 17.2 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 9173 |
| Missing (%) | 90.8% |
| Memory size | 79.0 KiB |
| False | |
|---|---|
| True | 6 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 921 | 9.1% |
| True | 6 | 0.1% |
| (Missing) | 9173 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 9168 |
| Missing (%) | 90.8% |
| Memory size | 79.0 KiB |
| False | 902 |
|---|---|
| True | 30 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 902 | 8.9% |
| True | 30 | 0.3% |
| (Missing) | 9168 |
vomiting_level
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.0 KiB |
| nan | |
|---|---|
| 1.0 | 274 |
| 2.0 | 100 |
| 3.0 | 8 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 30300 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | nan |
|---|---|
| 2nd row | nan |
| 3rd row | nan |
| 4th row | nan |
| 5th row | nan |
| Value | Count | Frequency (%) |
| nan | 9718 | |
| 1.0 | 274 | 2.7% |
| 2.0 | 100 | 1.0% |
| 3.0 | 8 | 0.1% |
| Value | Count | Frequency (%) |
| nan | 9718 | |
| 1.0 | 274 | 2.7% |
| 2.0 | 100 | 1.0% |
| 3.0 | 8 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 19436 | |
| a | 9718 | |
| . | 382 | 1.3% |
| 0 | 382 | 1.3% |
| 1 | 274 | 0.9% |
| 2 | 100 | 0.3% |
| 3 | 8 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 29154 | |
| Decimal Number | 764 | 2.5% |
| Other Punctuation | 382 | 1.3% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 382 | |
| 1 | 274 | |
| 2 | 100 | 13.1% |
| 3 | 8 | 1.0% |
| Value | Count | Frequency (%) |
| n | 19436 | |
| a | 9718 |
| Value | Count | Frequency (%) |
| . | 382 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29154 | |
| Common | 1146 | 3.8% |
Most frequent character per script
| Value | Count | Frequency (%) |
| . | 382 | |
| 0 | 382 | |
| 1 | 274 | |
| 2 | 100 | 8.7% |
| 3 | 8 | 0.7% |
| Value | Count | Frequency (%) |
| n | 19436 | |
| a | 9718 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30300 |
Most frequent character per block
| Value | Count | Frequency (%) |
| n | 19436 | |
| a | 9718 | |
| . | 382 | 1.3% |
| 0 | 382 | 1.3% |
| 1 | 274 | 0.9% |
| 2 | 100 | 0.3% |
| 3 | 8 | < 0.1% |
| Distinct | 998 |
|---|---|
| Distinct (%) | 22.9% |
| Missing | 5743 |
| Missing (%) | 56.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.140460179 |
|---|---|
| Minimum | 0.72 |
| Maximum | 24.25 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 0.72 |
|---|---|
| 5-th percentile | 1.8 |
| Q1 | 3.2 |
| median | 4.8 |
| Q3 | 6.6 |
| 95-th percentile | 9.9 |
| Maximum | 24.25 |
| Range | 23.53 |
| Interquartile range (IQR) | 3.4 |
Descriptive statistics
| Standard deviation | 2.551745248 |
|---|---|
| Coefficient of variation (CV) | 0.4964040493 |
| Kurtosis | 1.903078458 |
| Mean | 5.140460179 |
| Median Absolute Deviation (MAD) | 1.7 |
| Skewness | 1.007880156 |
| Sum | 22396.985 |
| Variance | 6.511403811 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.8 | 30 | 0.3% |
| 3.2 | 28 | 0.3% |
| 3 | 28 | 0.3% |
| 3.3 | 28 | 0.3% |
| 2.4 | 26 | 0.3% |
| 3.4 | 26 | 0.3% |
| 5.3 | 26 | 0.3% |
| 3.9 | 26 | 0.3% |
| 3.6 | 26 | 0.3% |
| 5.4 | 25 | 0.2% |
| Other values (988) | 4088 | |
| (Missing) | 5743 |
| Value | Count | Frequency (%) |
| 0.72 | 1 | |
| 0.73 | 1 | |
| 0.75 | 1 | |
| 0.77 | 1 | |
| 0.8 | 2 |
| Value | Count | Frequency (%) |
| 24.25 | 1 | |
| 21.34 | 1 | |
| 18.4 | 1 | |
| 17.7 | 1 | |
| 16.92 | 1 |
| Distinct | 56 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 6669 |
| Missing (%) | 66.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 59.78796269 |
|---|---|
| Minimum | 39 |
| Maximum | 93 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 79.0 KiB |
Quantile statistics
| Minimum | 39 |
|---|---|
| 5-th percentile | 45.5 |
| Q1 | 55 |
| median | 60 |
| Q3 | 65 |
| 95-th percentile | 74 |
| Maximum | 93 |
| Range | 54 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 8.558127313 |
|---|---|
| Coefficient of variation (CV) | 0.1431413102 |
| Kurtosis | 0.9608959065 |
| Mean | 59.78796269 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.322841734 |
| Sum | 205132.5 |
| Variance | 73.24154311 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 60 | 320 | 3.2% |
| 62 | 224 | 2.2% |
| 57 | 214 | 2.1% |
| 53 | 158 | 1.6% |
| 63 | 151 | 1.5% |
| 68 | 143 | 1.4% |
| 55 | 141 | 1.4% |
| 65 | 140 | 1.4% |
| 51 | 123 | 1.2% |
| 50 | 121 | 1.2% |
| Other values (46) | 1696 | 16.8% |
| (Missing) | 6669 |
| Value | Count | Frequency (%) |
| 39 | 18 | 0.2% |
| 40 | 17 | 0.2% |
| 42 | 62 | |
| 43 | 35 | |
| 44 | 21 | 0.2% |
| Value | Count | Frequency (%) |
| 93 | 10 | |
| 90 | 16 | |
| 87 | 1 | < 0.1% |
| 85 | 16 | |
| 83 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| study_no | date | abdominal_pain_level | abdominal_tenderness | age | albumin | alt | anorexia | ascites | ast | bleeding | bleeding_gum | bleeding_mucosal | bleeding_nose | bleeding_other | bleeding_skin | bleeding_vaginal | body_temperature | bruising | chills | clinical_shock | conjunctival | convulsions | cough | creatine_kinase | dbp | diarrhoea | event_admission | event_enrolment | feeling_faint | fetal | fibrinogen | haematocrit | haematocrit_percent | haemoglobin | headache_level | hematemesis | hematuria | inr | jaundice | joint_pain_level | lethargy | liver_palpation | liver_size | lymphadenopathy | lymphocytes_percent | melaena | meningism | monocytes_percent | muscle_pain_level | nausea | neurology | neutrophils_percent | oedema | oedema_face | oedema_feet | oedema_hands | petechiae | pharyngeal | pleural_effusion | plt | pulse | rales_crackles | respiratory_distress | respiratory_rate | restlessness | rhinitis | sbp | skin_flush | skin_rash | sore_throat | spleen_palpation | tck | tq | uterus_tender | vagina_loss | vomiting_level | wbc | weight | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 03-1001 | 2017-07-08 00:00:00 | NaN | NaN | 31.0 | NaN | NaN | True | NaN | NaN | NaN | NaN | False | NaN | NaN | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | True | NaN | NaN | NaN | NaN | NaN | 1.0 | NaN | NaN | NaN | NaN | 2.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 75.0 |
| 1 | 03-1001 | 2017-07-10 00:00:00 | NaN | NaN | 31.0 | NaN | NaN | True | NaN | NaN | NaN | NaN | False | NaN | NaN | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | True | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 75.0 |
| 2 | 03-1001 | 2017-07-10 12:30:00 | NaN | NaN | 31.0 | NaN | NaN | True | NaN | NaN | NaN | NaN | False | NaN | NaN | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | True | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 75.0 |
| 3 | 03-1001 | 2017-07-10 13:00:00 | NaN | NaN | 31.0 | 35.2 | 15.0 | True | NaN | 21.0 | NaN | NaN | False | NaN | NaN | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 41.0 | NaN | NaN | NaN | NaN | NaN | NaN | 3.6 | NaN | 31.0 | 10.0 | NaN | NaN | NaN | 1.09 | NaN | NaN | NaN | NaN | NaN | NaN | 16.6 | NaN | NaN | 6.3 | NaN | NaN | NaN | 74.2 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 193.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 45.0 | 13.9 | NaN | NaN | NaN | 5.09 | 75.0 |
| 4 | 03-1001 | 2017-07-11 08:00:00 | NaN | False | 31.0 | NaN | NaN | True | False | NaN | False | False | False | False | NaN | False | False | 37.2 | False | NaN | False | False | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | True | NaN | NaN | NaN | NaN | NaN | False | False | NaN | False | NaN | False | False | NaN | False | NaN | False | False | NaN | NaN | NaN | False | NaN | False | False | False | False | False | False | False | NaN | 90.0 | False | False | 20.0 | False | True | 110.0 | False | False | NaN | False | NaN | NaN | False | False | NaN | NaN | 75.0 |
| 5 | 03-1001 | 2017-07-11 10:00:00 | NaN | NaN | 31.0 | NaN | NaN | True | NaN | NaN | NaN | NaN | False | NaN | NaN | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 30.7 | 10.3 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 23.3 | NaN | NaN | 6.1 | NaN | NaN | NaN | 67.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 211.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 3.95 | 75.0 |
| 6 | 03-1001 | 2017-07-12 05:00:00 | NaN | False | 31.0 | NaN | NaN | True | False | NaN | False | False | False | False | NaN | False | False | 37.2 | False | NaN | False | False | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | True | NaN | NaN | NaN | NaN | NaN | False | False | NaN | False | NaN | False | False | NaN | False | NaN | False | False | NaN | NaN | NaN | False | NaN | False | False | False | False | False | False | False | NaN | 90.0 | False | False | 20.0 | False | False | 110.0 | False | False | NaN | False | NaN | NaN | False | False | NaN | NaN | 75.0 |
| 7 | 03-1001 | 2017-07-12 10:40:00 | NaN | NaN | 31.0 | 34.5 | 18.0 | True | NaN | 23.0 | NaN | NaN | False | NaN | NaN | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 42.0 | NaN | NaN | NaN | NaN | NaN | NaN | 3.0 | NaN | 34.3 | 10.9 | NaN | NaN | NaN | 1.00 | NaN | NaN | NaN | NaN | NaN | NaN | 33.0 | NaN | NaN | 8.4 | NaN | NaN | NaN | 57.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 215.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 45.2 | 12.2 | NaN | NaN | NaN | 3.79 | 75.0 |
| 8 | 03-1001 | 2017-07-13 08:00:00 | NaN | False | 31.0 | NaN | NaN | True | False | NaN | False | False | False | False | NaN | False | False | 37.2 | False | NaN | False | False | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | True | NaN | NaN | NaN | NaN | NaN | False | False | NaN | False | NaN | False | False | NaN | False | NaN | False | False | NaN | NaN | NaN | False | NaN | False | False | False | False | False | False | False | NaN | 86.0 | False | False | 20.0 | False | False | 100.0 | False | False | NaN | False | NaN | NaN | False | False | NaN | NaN | 75.0 |
| 9 | 03-1001 | 2017-07-13 09:00:00 | NaN | NaN | 31.0 | NaN | NaN | True | NaN | NaN | NaN | NaN | False | NaN | NaN | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 34.8 | 11.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 33.7 | NaN | NaN | 6.3 | NaN | NaN | NaN | 58.6 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 214.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 5.07 | 75.0 |
Last rows
| study_no | date | abdominal_pain_level | abdominal_tenderness | age | albumin | alt | anorexia | ascites | ast | bleeding | bleeding_gum | bleeding_mucosal | bleeding_nose | bleeding_other | bleeding_skin | bleeding_vaginal | body_temperature | bruising | chills | clinical_shock | conjunctival | convulsions | cough | creatine_kinase | dbp | diarrhoea | event_admission | event_enrolment | feeling_faint | fetal | fibrinogen | haematocrit | haematocrit_percent | haemoglobin | headache_level | hematemesis | hematuria | inr | jaundice | joint_pain_level | lethargy | liver_palpation | liver_size | lymphadenopathy | lymphocytes_percent | melaena | meningism | monocytes_percent | muscle_pain_level | nausea | neurology | neutrophils_percent | oedema | oedema_face | oedema_feet | oedema_hands | petechiae | pharyngeal | pleural_effusion | plt | pulse | rales_crackles | respiratory_distress | respiratory_rate | restlessness | rhinitis | sbp | skin_flush | skin_rash | sore_throat | spleen_palpation | tck | tq | uterus_tender | vagina_loss | vomiting_level | wbc | weight | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 10090 | 03-5612 | 2018-01-31 09:40:00 | NaN | NaN | 30.0 | NaN | NaN | True | NaN | NaN | NaN | NaN | False | NaN | NaN | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 41.2 | 13.5 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 37.5 | NaN | NaN | 7.3 | NaN | NaN | NaN | 54.8 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 89.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2.61 | NaN |
| 10091 | 03-5612 | 2018-02-01 08:00:00 | NaN | False | 30.0 | NaN | NaN | True | False | NaN | False | False | False | False | NaN | False | False | 37.2 | False | NaN | False | False | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | False | False | NaN | False | NaN | False | False | NaN | False | NaN | False | False | NaN | NaN | NaN | False | NaN | False | False | False | False | False | False | False | NaN | 94.0 | False | False | 20.0 | False | False | 110.0 | False | True | NaN | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 10092 | 03-5612 | 2018-02-01 10:00:00 | NaN | NaN | 30.0 | 40.1 | 86.0 | True | NaN | 128.0 | NaN | NaN | False | NaN | NaN | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 60.0 | NaN | NaN | NaN | NaN | NaN | NaN | 2.88 | NaN | 47.4 | 14.4 | NaN | NaN | NaN | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | 32.1 | NaN | NaN | 5.5 | NaN | NaN | NaN | 34.1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 54.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 43.9 | 12.2 | NaN | NaN | NaN | 3.47 | NaN |
| 10093 | 03-5612 | 2018-02-02 07:30:00 | NaN | False | 30.0 | NaN | NaN | True | False | NaN | False | False | False | False | NaN | False | False | 37.0 | False | NaN | False | False | False | NaN | NaN | 60.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | False | False | NaN | False | NaN | False | False | NaN | False | NaN | False | False | NaN | NaN | NaN | False | NaN | False | False | False | False | False | False | False | NaN | 88.0 | False | False | 20.0 | False | False | 100.0 | False | True | NaN | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 10094 | 03-5612 | 2018-02-02 10:10:00 | NaN | NaN | 30.0 | NaN | NaN | True | NaN | NaN | NaN | NaN | False | NaN | NaN | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 45.1 | 14.1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 33.0 | NaN | NaN | 5.1 | NaN | NaN | NaN | 32.9 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 41.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 10.03 | NaN |
| 10095 | 03-5612 | 2018-02-02 11:00:00 | 1.0 | NaN | 30.0 | NaN | NaN | True | NaN | NaN | NaN | NaN | False | NaN | NaN | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 10096 | 03-5612 | 2018-02-02 13:00:00 | NaN | NaN | 30.0 | NaN | NaN | True | NaN | NaN | NaN | NaN | False | NaN | NaN | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | True | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 10097 | 03-5612 | 2018-02-03 08:00:00 | NaN | True | 30.0 | 37.3 | 71.0 | True | False | 91.0 | False | False | False | False | NaN | False | False | 37.0 | False | NaN | False | False | False | NaN | 34.0 | 60.0 | NaN | NaN | NaN | NaN | NaN | 2.92 | NaN | 42.8 | 12.6 | NaN | False | False | 1.0 | False | NaN | False | False | NaN | False | 37.8 | False | False | 6.7 | NaN | NaN | False | 34.9 | False | False | False | False | False | False | False | 43.0 | 87.0 | False | False | 20.0 | False | False | 100.0 | False | True | NaN | False | 40.8 | 12.2 | NaN | NaN | NaN | 7.08 | NaN |
| 10098 | 03-5612 | 2018-02-04 05:00:00 | NaN | NaN | 30.0 | NaN | NaN | True | NaN | NaN | NaN | NaN | False | NaN | NaN | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 38.7 | 11.8 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 45.8 | NaN | NaN | 7.3 | NaN | NaN | NaN | 33.9 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 64.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 8.70 | NaN |
| 10099 | 03-5612 | 2018-02-08 08:45:00 | NaN | NaN | 30.0 | 46.2 | 93.0 | True | NaN | 53.0 | NaN | NaN | False | NaN | NaN | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 38.0 | NaN | NaN | NaN | NaN | NaN | NaN | 3.19 | NaN | 43.6 | 12.8 | NaN | NaN | NaN | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | 25.5 | NaN | NaN | 6.5 | NaN | NaN | NaN | 64.9 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 399.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 28.6 | 11.8 | NaN | NaN | NaN | 9.85 | NaN |